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We present a rigorous mathematical framework for analyzing dynamics of a broad class of Boolean network 
models. We use this framework to provide the first formal proof of many of the standard critical transition 
results in Boolean network analysis, and offer analogous characterizations for novel classes of random Boolean 
networks. We precisely connect the short-run dynamic behavior of a Boolean network to the average influence of 
the transfer functions. We show that some of the assumptions traditionally made in the more common mean-field 
analysis of Boolean networks do not hold in general. For example, we offer some evidence that imbalance, or 
expected internal inhomogeneity, of transfer functions is a crucial feature that tends to drive quiescent behavior 
far more strongly than previously observed. 
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Introduction. Complex systems can usually be repre- 
sented as a network of interdependent functional units. 
Boolean networks were proposed by Kauffman as models of 
genetic regulatory networks [TTl and have received consider- 
able attention across several scientific disciplines. They model 
a variety of complex phenomena, particularly in theoretical 
biology and physics |[3}j8l. 

A Boolean network ,yV with n nodes can be described by 
a directed graph G — {V,E) and a set of transfer functions. 
We use V and E to denote the sets of nodes and edges respec- 
tively, and denote the indegree of node / by Ki. Each node 
/ is assigned a Ki-aiy Boolean function /, : {— 1,+1}^' -^ 
{— 1,+1}, termed transfer function. If the state of node / at 
time t is x,(f ), its state at time f + 1 is described by 



x,-(f + !)=/,■ (x,-,(f),. 



^'Kj 
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The state of ^ at time t is just the vector 

{xi{t),X2{t),...,X„{t)). 

Boolean networks are studied by positing a distribution of 
graph topologies and Boolean functions from which indepen- 
dent random draws are made. We denote the distribution of 
transfer functions by £^. An early observation was that when 
the indegree of a network is fixed at K and each transfer func- 
tion is chosen uniformly randomly from the set of all ^-input 
possibilities, the network dynamics undergo a critical transi- 
tion at K = 2, such that for K < 2 the network behavior is 
quiescent and small perturbations die out, while for K > 2 it 
exhibits chaotic features [2|. This result has been generalized 
to non-homogeneous distributions of transfer functions, when 
the output bit is set to 1 with probability p (called bias) in- 
dependently for every possible input string t9J. The resulting 
critical boundary is described by the equation 2Kp{\ ~p) — \. 

All analysis of Boolean networks to date uses mean-field 
approximations, an annealed approximation [9J, simulation 
studies |[T]|2l, or combinations of these, to understand the dy- 
namic behavior Many previous studies rely solely on short- 
run characteristics (e.g., Derrida plots that consider only a 
very short, often only a single-step, horizon |l4l|5]|7l) and ex- 
trapolate to understand long-term dynamics. Hamming dis- 
tance between Boolean network states that diverges exponen- 
tially over time for small perturbations to initial state sug- 
gests sensitivity to initial conditions typically associated with 
chaotic dynamical systems. Nonetheless, the connection be- 



tween short-run and long-run sensitivity is not a foregone con- 
clusion [lOJ and remains an open question. 

We provide a formal mathematical framework to analyze 
the behavior of Booleam networks over a logarithmic (in the 
size of the graph) number of discrete time steps, and give 
conditions for exponential divergence in Hamming distance 
in terms of the indegree distribution and influence of transfer 
functions in ,"7 . 

Assumptions. We assume that the Boolean network .A" is 
constructed as follows. First, we specify an indegree distribu- 
tion 2i with a maximum possible indegree K,nax, and for each 
node / independently draw its indegree Ki ^ '3. We then con- 
struct G by choosing each of the Ki neighbors of every node / 
uniformly at random from all n nodes. Next, for each node / 
we independently choose a /r,-input transfer function accord- 
ing to 5^. We assume that the family 3' has either of the 
following properties: 

• Full independence: Each entry in the truth table of a 
transfer function is i.i.d., or 

• Balanced on average: Transfer functions drawn from 
S^ have, on average, an equal number of +1 and —1 
output entries in the truth table. Formally, ^rf_x\f(x) = 
+ 1] — 1/2, where Pr/.^ denotes the probability of an 
event when / is drawn from 3 , and input x for / is 
chosen uniformly at random. 

Influence. The notion of influence of variables on Boolean 
functions was defined by Kahn et al. ifTTI and introduced to 
the study of Boolean networks by Shmulevich and Kauff- 
man [4J. The influence of input / on a Boolean function /, 
denoted by lnf,(/), is 

Inf,(/)=Pr,[/(x)^/(x«)], 

where x^'' is the same as x in all coordinates except /. Given 
a distribution -3 of transfer functions, let J^ denote the 
induced distribution over cZ-input transfer functions. The 
expected total influence under 3^/, denoted by I{,%), is 
Ey^,^[^,lnf,(/)]. When ,%i is clear from the context we 
write this simply as I{d). Suppose that we have an indegree 
distribution where p{d) is the probability that indegree is d. 
We show that the quantity that characterizes the dynamic be- 



havior of Boolean networks is 

^ = £ Pid)iid). 

d=\ 

Main Result. We present our main result that character- 
izes dynamic behavior of Boolean networks under the as- 
sumptions stated above. Define f* = logn/(41og/r,„av). The 
following theorem tracks the evolution of Hamming distance 
up to time t*, starting with a small (single-bit) perturbation. 
We note that our theorem applies for any distribution of inde- 
grees with a maximum bounded by K,,uix, though increasing 
density {K„uix) shortens the effective horizon t* . 

Theorem 1 Choose a random Boolean network ^/K having a 
random graph G with n nodes and a distribution of trans- 
fer functions 3'. Evolve t/K in parallel from a uniform ran- 
dom starting state x and its flip perturbation x-^ {with a uni- 
form random i). The expected Hamming distance between 
the respective states of .jV at time t < t* lies in the range 

^'±1/«V4. 

The proof of this theorem is non-trivial and is provided in 
the supplement. It shows that the effects of flip perturbations 
vanish when J^ < 1 while perturbations diverge exponentially 
when y > I. Thus, criticality of the system is equivalent to 

y = i. 

Much of the past work assumed (or explicitly stated) that 
it suffices to consider the expected influence value I{K) for 
the mean indegree K. A direct consequence of Theorem [T] is 
that I{K) characterizes a critical transition iff I{d) is affine. 
To see this, observe that I{K) = .f iff I{K) =I(Zddpid)) = 
Y.dP{d)I{d). This is true if and only \f I{d) is affine. 

Applications. In this section we use Theorem[T]to recover 
most of the characterizations of critical indegree thresholds 
to date and prove results for new natural classes of transfer 
functions. We show that our assumptions are crucial in ob- 
taining the observed results. An important step in applying 
the theorem is computing the quantity I{d) for a given class 
of transfer functions ,^. The following proposition (proven in 
the online supplement) facilitates this process. Let .S§'^' denote 
a ^/-dimensional Boolean hypercube. The edges of ^'^ con- 
nect pairs of elements with Hamming distance 1 . A function 
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' can be represented by labeling element x e ^ 



by f{x). An edge of ^'' is called f-bichromatic if one end- 
point is labeled +1 and the other —1. 

Proposition 2 Consider a distribution S'd over d-input func- 
tions. Then 



K^d. 



E^^^ [# f-bichromatic edges] 



Uniform random transfer functions. We begin with the 
classical model of random Boolean networks in which each 
entry in the truth table of a transfer function is chosen to be 
+ 1 and —1 with equal probability. It has previously been ob- 
served that the critical transition occurs at mean indegree K — 
2 |l9l. We now demonstrate that it is a simple corollary of our 
theorem. First, we need to compute I{d) using Proposition [2] 



In this model, the probability that an edge is /-bichromatic 
is exactly 1/2. Hence, I{d) — (total number of edges)/2''. 
Since the total number of edges (of .'3S'^) is d2'^'^\ we obtain 
I{d) — d/2. Notice that I{d) is linear in this case, and, conse- 
quently, considering I{K) = K/2 suffices for any distribution 
p{d). Applying Theoremfllthen gives us the well-known crit- 
ical transition at /T = 2. 

Transfer functions with a bias p. A simple generalization 
of uniform random transfer functions is to introduce a bias, 
that is, a probability p that an entry in the truth table is +1 
(but still filling in the truth table with i.i.d. entries) f2l. In this 
case, the probability that an edge is /-bichromatic is 2/?(l —p) 
and therefore I{d) = 2dp{\ — p). Since I{d) is linear, we can 
characterize the critical transition in this case at 2Kp{\ — p) = 
1 for any indegree distribution with mean K. 

Canalizing functions. Kauffman ||2| and others have ob- 
served that since uniform random transfer functions are typ- 
ically chaotic, they are unlikely to represent a distribution 
of transfer functions that accurately models real phenomena, 
such as genetic regulatory networks. Biased transfer functions 
only partially resolve this, as they still tend to fall easily into 
a chaotic regime for a rather broad range of /? IS). Empirical 
studies of genetic networks suggest another class of transfer 
functions called canalizing. A canalizing function has at least 
one input, /, such that there is some value of that input, v,, that 
determines the value of the Boolean function. Shmulevich 
and Kauffman |4 1 show heuristically that canalizing functions 
have I{K) = {K+l)/4 and thus exhibit a critical transition at 
K = 3. We now show that this is a corollary of our theorem, 
using Proposition [2] to obtain I{d). 

To compute I{d), fix (without loss of generality) the canal- 
izing input index to be 1 and the canalizing input and output 
values to +1. Consider the distribution of functions condi- 
tional on these properties. By symmetry, the expected num- 
ber of bichromatic edges conditional on this is the same as 
the overall expectation. Hence, we can focus on choosing / 
from this conditional distribution. Split the hypercube ^'^' into 
the {d— 1) -dimensional sub-hypercubes ^' and 3§" such that 
£§' has all inputs with xi = +1 and ^" has all inputs that 
have xi = —1. Edges can be partitioned into three groups 
E\E",E*. The set of edges £' (resp. E") are those that are 
internal to ^' (resp. ^"). The set of edges E* have end- 
points in both ^ and M'. Note that \E'\ = \E"\ ^ {d -1)2''-^, 
and \E*\ = 2''^'. Because the function is canalizing, the 
edges in E' are all /-monochromatic, and all other edges 
are /-bichromatic with probability 1/2. Hence, the expected 
number of bichromatic edges is {{d — 1)2'^'^^ +2''^')/2 = 
2''''^{d+l)/4. By Proposition m we then have I{d) ^ {d + 
l)/4. Since this is affine in d, we can conclude that I{K) — 
{K + l)/4 characterizes the short-run dynamic behavior for 
any indegree distribution with mean K. 

Threshold functions. A threshold function f{x) with d 
inputs has the form sgn[/*(jc)] with 



/*W = ;,L^'-^'~^' 



i<d 



where jc, is the value of input /, w, G {— 1,+1} is its weight, 
which has a natural interpretation of an input being inhibit- 
ing (wj = —1) or excitatory (w,- = +1) in regulatory networks, 
and is a real number in [—1,+!] representing an inhibit- 
ing/excitatory threshold for /. Such 2-input threshold func- 
tions have been studied by Greil and Drossel lfT2l and Sze- 
jka et al. |[T3l and are classified as biologically meaningful 
by Raeymaekers fT4|. We now use Theorem [T] to show that 
random threshold functions lead to criticality for any indegree 
distribution. 

Consider 3^ in which the value of w,- for each input /, as 
well as 9, are chosen uniformly at random. To compute I{d), 
consider a threshold function with threshold 9 and an edge 
{x,x^''). This edge is bichromatic exactly when the 9 lies be- 
tween f{x) and /(.x(')). Note that |/* (x) - /* (x^ ) | = 2/d, 
regardless of the values wi,... ,Wci- Since the range of 9 has 
size 2, the probability that this happens is {2/d)/2 — l/d. So 
I{d) = (# of edges )/ii2''^' ~ 1. Since it is independent of li, 
the result follows immediately by Theoremfl] 

Majority function. An important specific threshold func- 
tion is a majority function, which has w,- = 1 for all inputs / 
and 0=0. Suppose 3^ consists exclusively of majority func- 
tions. We demonstrate that the quiescence-chaos transition 
properties of this class are very different from those of gen- 
eral threshold functions. One detail that needs to be specified 
for ^ is what to do when the number of positive and negative 
inputs is exactly balanced. To satisfy the condition that ^ is 
balanced in expectation, we let the output be +1 or — 1 with 
equal probability in such an instance (for a specific majority 
function this choice is determined, but it is randomized for any 
majority function generated from ^). Given this ^, we now 
show that 
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When d is odd, bichromatic edges are those that connect the 
[d/2j -level to the [£//2] -level. For d even, these are the 
edges connecting the ii/2-level to the {d/2 — 1) -level (or the 
{d/2 + l)-level). In either case, the number of these edges 
is [(i/2] (, ^21), giving I{d) as above. Consequently, when 
£/ = 1 or 2, 1{d) = 1, while for d>3, 1{d) > 3/2. Thus, if a 
Boolean network has a fixed indegree K, it is critical for K <2 
and chaotic for K > 2. 

Strong majority function. We now show an interest- 
ing and natural class of functions where the expected aver- 
age influence goes down as the indegree d increases. Con- 
sider threshold functions where w, = 1 for all inputs / and the 
threshold is either or —0 with equal probability for some 
fixed e [0, 1]. For example, when = 1/3, the function re- 
turns + 1 iff a 2/3 majority of inputs have value +1. For this 
class of functions, bichromatic edges are those that connect 
the [(i/2 + prfJ -level to the [ii/2 + p(i] -level, where p = 0/2. 
Thus, the expected number of bichromatic edges for a fixed d 
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FIG. 1. The X-axis is K. The y-axis gives the average influence pa- 
rameter /(A"). We show the cases where p = 1/5, 1/6, 1/7. For larger 
p we reach quiescent behavior more rapidly with increasing K. 



and, consequently, I{d) — Be/2''^'. In FigurelTlwe plot I{K), 
where K is, a. fixed indegree, for different values of p . There 
are two rather remarkable observations to be made about this 
class of transfer functions: first, the sawtooth behavior of 
I{K), and second, that the Boolean network actually becomes 
more quiescent with increasing K. To our knowledge, this is 
the first example in which there is no single critical transi- 
tion from order to chaos, and increasing connectivity leads to 
greater order. We show that for d large enough, I{d) tends 
to 0. For convenience, assume that d is an even integer 
and 9d is non-integral. By tail bounds on binomial coeffi- 
cients, 2^''^r>i^/2+p(/j ('^.) < 2^"' for some constant c. (This 
can be proven using a Chernoff bound, such as Theorem 4. 1 
in ifTSl .) Hence I{d) < 1 for large enough d, and tends to 
zero as d increases. We had previously noted that it is com- 
monly assumed that I{d) is linear in d. Strong majority trans- 
fer functions feature I{d) that is clearly non-linear, and we 
therefore expect this assumption to be consequential. To il- 
lustrate, consider two network structures: one with a fixed 
K — 4, and another where the indegree distribution follows a 
power law with mean K — 4. Using = 1/3, in the former, 
we get J' = I{K) = 1.5, while in the latter (with K„,ax = 100), 
J^ = 0.79. Thus, while a fixed K yields decidedly chaotic dy- 
namics, using a power law distribution with the same mean 
indegree produces quiescence. 

Tlie importance of graph structure. Our results rely fun- 
damentally on the fact that the inputs into each node are cho- 
sen independently. The fact that the size of the neighbor- 
hood at distance t grows exponentially with t is crucial for 
our proofs. Furthermore (for the random graphs we sample 
from), this neighborhood is a root directed tree, when t < t*. 
When graphs exhibit only polynomial local growth, we do not 
expect chaotic dynamic behavior even when other conditions 
for it are met. We illustrate this point in Figure [2] (left), which 
compares a random network with /T = 4 to a grid (a bidirec- 
tional square lattice that also has K — 4). While both initially 
appear to be in a chaotic regime, the Hamming distance stops 
diverging for a grid, but diverges exponentially in the random 
network. 

The importance of being balanced. The assumption that 
,5^ is balanced is crucial. Balance has previously been noted to 
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FIG. 2. Boolean network dynamics (Hamming distance over time, starting with single-bit perturbations). Left: comparing random graph with 
K = A and a grid. Middle: random networks with fixed A" = 10 and unbalanced strong majorities. Right: random networks with A" = 5, using 
nested canalizing functions, with degree of imbalance increasing with a, and the empirical distribution of 5-input transfer functions based on 
yeast regulatory networks [!3]|71. 



play an important role in determining the order to chaos tran- 
sition, but entirely under the assumption that each truth table 
entry is i.i.d. |2l. It has been pointed out that much of the 
resulting space of parameter values gives rise to chaotic dy- 
namics IS) . What we now demonstrate is that this observation 
is largely an artifact of independence, and when truth table 
entries are not independently distributed, even a slight devia- 
tion from balance (homogeneity) may push Boolean network 
dynamics to quiescence. Consider networks in which every 
transfer function is a strong majority (with 0=0 being a sim- 
ple majority). We get a balanced distribution of transfer func- 
tion by choosing between Q and —0 with equal probability. 
An unbalanced distribution is obtained by choosing only one 
of them. Figurel2](middle) shows several examples of how the 
Hamming distance evolves for different values of 9, and con- 
trasts the balanced and unbalanced settings. The difference 
could hardly be more dramatic: even a slight deviation from 
simple majority (0 — 0.01) is a difference between chaos and 
quiescence; indeed, it is instructive to see the initial increase 
in Hamming distance for the unbalanced strong majority with 
9 = 0.01, only to be ultimately suppressed. Similarly, we can 
compare the balanced and unbalanced versions of strong ma- 
jorities with = 1/3: the balanced version is clearly chaotic, 
while in the network with the unbalanced analogue, initial per- 
turbation effects erode within two iterations. A similar picture 
emerges when we consider nested canalizing functions, previ- 
ously offered as an explanation of robustness in genetic regu- 
latory networks 1 3 , 7 1 . Classes of these are generated by a pa- 
rameter a that governs the fraction of 1 's in the transfer func- 
tion tiTith table, with larger values of a leading to greater im- 
balance. Figure|2](right) compares evolution of networks with 
nested canalizing functions, as well as with transfer functions 
following an empirical distribution of transfer functions based 
on regulatory networks 1 7 1 . We see that the main driver of qui- 
escence appears to be the internal inhomogeneity of transfer 
functions, rather than canalizing properties. 
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Supplemental Material 

The following supplemental material for the paper includes 
all formahzation and proof details. 



Preliminaries and notation 

We will be using probabilities very heavily, so it will help to 
set some notations. Capital letters X^Y,Z will used to denote 
random variables. Events are denoted by calligraphic letters 

<#", ^, The probability of an event S is denoted by Pr(<#). 

The expectation of a random variable X is denoted by E[X]. 
The random variable X conditioned on event S is denoted by 
X\S. 



Preliminaries 

Graphs: We will always deal with directed graphs 
G = iy^E). The set V will be [n], the set of positive integers 
upto n. For every / G V, N^{i) (resp. N^{i)) denotes the set 
of out (resp. in) neighbors of / in G. We use d^ (/) to denote 
the outdegree of / (similarly, define d^ (/)). 

Boolean strings and functions: We will use the set { — 1 ,+ 1 } 

(instead of {0, 1}) to denote bits, thereby aligning ourselves 
with the theory of Boolean functions. The n-dimensional 
Boolean hypercube is denoted by 3^" = { — 1,+!}". This is 
also a representation for all n bit strings. A Boolean function 
is a function f : .^" -^ £§K 

For any two elements x,y ^ 3§„, A{x,y) is the Hamming dis- 
tance between x and y. For x € ^", x^'' is the unique element 
of ^" which is the same as x on all coordinates except for the 
rth coordinate. We use x, to denote the ith coordinate of x. 

We use Prf(ff) (where (f is some event) to denote the 
probability of S" over the uniform distribution of x (which 
means we choose an x uniformly at random from -W and 
check the probability of <#). Similarly, Ev[. ..] denotes the 
expectation over a random uniform x. 

Boolean networks: A Boolean network ^ consists of 
a directed graph G and a set of transfer functions T. The set 
T has a transfer function Tv : ^'' ^'' — > {— 1,+1}, for each 

iev. 

The state of a Boolean network is just an assignment of 
{—1,4-1} to each vertex in V. This can be represented as an 
n-bit string, or alternately, an element x of the n-dimensional 
Boolean hypercube ^" = { - 1 , + 1 }" . 

Suppose o/K starts at the state x. The state at vertex / after 
t steps of ^ is denoted by the Boolean function ftj{x). The 
function ft (x) : ^" -> 3§" is state of ^ after f-steps (so this 
is an n-dimensional vector having ft,i{x) as its rth coordinate). 

Assumptions on Boolean network: We will analyze 
Boolean networks that arise from a particular distribution. 
First, the graph G is chosen from a random distribution. We 



will assume some fixed indegree distribution &. (This is 
simply a distribution on positive integers.) For each vertex 
/, we independently choose d^{i) from &. Then, we choose 
d^{i) uniform random vertices (without replacement) to be 
the in-neighborhood (N^{i)) of /. We denote the average 
indegree by K, and the maximum possible indegree by /C„n.v 

Next, we assume that there is a distribution 3^ on trans- 
fer functions. Formally, this is a union of distributions ^, 
where this family only contains Boolean functions that take 
d inputs. For each vertex /, we first choose an independent 
function T,(xi,X2,. . .jX^-j,)) from ,'^ci-U)- Sort A^^(/) (accord- 
ing to its label) to get vi , V2 , . . . , vj- (,) . Assign the vertex vy to 
input Xj of T,. This gives us the transfer function for vertex /. 

We will assume that the family ^ has either of the follow- 
ing properties: 

• Full independence: A random function in ."^ is gener- 
ated by taking an empty truth table, and filling in each 
entry independently with the same distribution. 

• Balance on average: A uniform random member of S^ 
evaluated on a uniform random input outputs +1 with 
probability 1/2. Formally, ?r^^^[z{x) = +1] = 1/2 = 
Pr,^,[T(x) = -l] 



Influences 

We now discuss some of our main definitions. The follow- 
ing is one of the most important concepts. 

Definition 3 For a Boolean function f and coordinate i, the 
influence of i on f, denoted by Inf,(/) is Pr;i[/(x) ^ f{x^'')]. 
The total influence off is L(Inf,(/) and the average influence 
offisl^L.mif). 

For a distribution ^ = Ud-3^d of transfer functions, the in- 
fluence of the distribution S'd is denoted by I{S^d)- Formally, 
^{^d) — ET£,Tnf,(T)]. Often, when the defintion of ^ is un- 
ambiguous, we write this as Id- 

Definition 4 The influence of / at time t on J^ , denoted by 
lnf,,(^), is E;c[A(/,(x),/f(x(')))]. The average influence at 
timer w ^LInfr,i(-^)- 

Claim 5 The average influence of .jV at time t can be ex- 
pressed as \ Li</ j<« Inf,(/r,y). 

Proof: Let us focus on lnf,(o/K). We choose a uniform 
random x and evolve jy from the states x and x''^ Let 
X{t,j) be the indicator random variable for the event that 
ft.jix) ^ /,,,(x«). Note that E,[;t(f,;)] = Inf,(/r,;). By the 
definition of Hamming distance and linearity of expectation, 

Inf,-(^)=E,[A(/(x),/(x«))] 

= E.[Ez(f,;)] -EE,[z(f,;)] =Llnf,(/r,;) 
j J J 

Averaging this equality over all / completes the proof. D 

We will need the following simple facts about influences. 
(This is a restatement of Proposition [2] 



Proposition 6 • Consider a function f : ^" —> ^. 
An edge of the Boolean hypercube ,9§ is called 
bichromatic if one endpoint is labelled +1 and 
the other is labelled —1. Then Y,i<d^^^i{f) = 
(# bichromatic edges) jl . 

Consider a distribution 3" over functions f ; SS — ^ £^. 
E [^ Inf; (/)] = (! /2''" ' ) ^ Pi-[e is bichromatic] 



i<d 



e-.eilge SfM'' 



• For any Boolean function f and input index i, 

Pr[(/(x) = l)A(/(x)^/(x«))] 

X 

= Pr[(/(x) = -1) A (fix) + /(x(')))] = InfK/)/2 



Proof: Consider all pairs (y,}''''), where the /th bit of y is 
1. These pairs form a partition of the hypercube and are ac- 
tually all edges of the hypercube parallel to the /th dimen- 
sion. The influence Inf,(/) is exactly the probability that a 
uniformly random x belongs to a bichromatic pair. Let B; be 
the number of bichromatic pairs. Noting that the total num- 
ber of edges parallel to the /th dimension is 2''^ ' , Inf,- (/) — 
Bi/l^'^"^. Summing over afl /, we get that Y.i<d^^^i{.f) ~ 
(# bichromatic edges) /2''^'. To deal with a distribution, we 
simply apply linear of expectation to this bound. 

Now for the second part. When does the event {f{x) = 
1) A (/(x) 7^ /(jc*-'')) happen? This happens when x belongs 
to a bichromatic pair, and f{x) = 1. Exactly half the member 
of bichromatic pairs have value 1 (or — 1). Hence, Prjp[(/(x) = 
l)A(/W^/(xW))]=InfK/)/2. 

We restate Theorem [T| for convenience. 

Theorem 1 The average influence at time t < t* for the 
Boolean network jY lies in the range {Y^dyiPd^dY ± 1/n'' . 



Proof strategy 

Before diving into the gory details of the proof of our main 
result, we offer a high level intuition of the overall proof. The 
proof essentially consists of two central steps. Lemma [9] and 
Claim [TT| discussed in Sections [] and [[respectively. Lemma|9] 
considers an idealized situation where the underlying graph 
of a Boolean network is simply a root-directed tree. Any 
modification of the state of a leaf travels up the tree, pos- 
sibly affecting the root. In this case, we can give an exact 
expression of the influence of leaves on the root. Our key 
next step is Claim[TT| which proves that for our distribution of 
graph topologies (which is closely related to the configuration 
model), the logarithmic-distance neighborhood of most nodes 
looks like a root-directed tree. Note that this does not imply 
that the graph itself decomposes into disjoint tree, since the 
trees rooted at each node are interconnected in complicated 
ways. 

These two steps are combined together for the final proof 
in Section [1 This proof makes heavy use of the linearity of 



expectation and some conditional probability arguments. It 
enables us to perform exact short-run analysis of the Boolean 
network by only considering local neighborhoods of an "av- 
erage" node. 

At the core of the proof of Lemmal9]is a straightforward in- 
duction argument. Consider a tree network, where we change 
the state at some leaf. The catch is that induction requires the 
family of transfer functions to satisfy the technical conditions 
of balance or full independence. This is one of the major in- 
sights of this work, since these conditions on transfer function 
families have generally been implicit in previous results. The 
proof forces us to make these conditions explicit. Section[]has 
the details. 



The proof of Claim 1 1 consists of a combinatorial probabil- 



ity calculation. We are generating our graph through the ran- 
domized process of choosing the (immediate) neighborhood 
for each node independently (and uniformly) at random. We 
show that the probability that the short-distance neighborhood 
of a node contains a cycle is extremely small. The formal 
proof is given in Section!] 



Influences on trees 

In this section, we will make some arguments about tree 
networks. Let T be a directed tree (so all edges point towards 
the root) with all leaves at the same depth h. We are interested 
in the influence of the leaf variables on the root r. Let the 
function giving the state of the root r at time t be /, ,-. Note that 
//j r is only a function of the leaves, since there is no feedback 
in this graph. Note that we are not particularly bothered with 
what happens in the leaves are time step 1 (since those values 
are not even defined). We are merely interested in how the 
values at the leaves will propagate up the tree. For each v, 
the transfer function T,, is chosen from 3 (technically, from 

Claim 7 Let T be a tree of depth h. For a leafl, let the path 
to the root be vq — i,vi,V2, ■ ■ ■ ,v/, — r. Then, E,57[Inff(//, ,-)] = 
n;=i E,5?^ [Iiifv,_j (Tv,)]. (We remind the reader that 1^ is either 
balanced on average or fully independent.) 

Proof: We prove by induction on the depth h. When 3' is 
balanced on average, we will also show that '^^Jyfhyix) = 
1] — 1/2. For the base case, set h — 1. Hence, all the leaves 
are directly connected to the root, and the set 3 has only 
one function Xy (for the root r). The probability (over x) that 
Tr(jc) = Tr(x(^') is exactly Inf^(Tr). Suppose 3 is balanced on 
average. Since /i,,- ^ T,-, Pr,^,;f[/i,,-(x) = 1] = Pr,^,.4Tr(x) = 
l] = l/2. 

Now for the induction step. Assume the claim is true for 
trees of depth h — \. We will denote the indegree of r by t/. The 
root r is connected to a series of subtrees Ti,T2,...,Tj of depth 
h — 1. The roots of each of these r i , r2 , . . . , r^/ are the children 
of r. For convenience, assume that £ ETi. Note that for ft 7^ 1 
and Vx, //j-i,r,,(x) — fh-i.ri,i^ )■ ^i^ '^^e final step, the func- 
tion evaluated is T^ (//,_ i^ri{x),fh-i,r2{^),- ■ ■ , fh- 1 ,r^ (x) ) . 



First, let us assume that !^ is fully independent (the 
proof is much easier in this case). The probability that 
//i-i.ri(-x) ¥^ fh-i.r^ix^^'') is, by the induction hypothesis, 
Yli=i Ej^pnfy. J (t,,.)]. Conditioned on this, what is the prob- 
ability that T, '(/,,_!.,, (x), . . . , ) ^ T,,(/,,_i.,, (x(^)), . . .)? Since 
each entry in the truth table of Ty is chosen independently, this 
probability is exactly E^rpnfv,^ , (Tv,J]. Multiplying, we get 
that E^^[Inf, (/,,..)] = ntiE,Hlnf,,',(Tv,)]. This completes 
the proof for this case. 

Now, we assume that 3^ is balanced on average. For con- 
venience, set random variable X,- = //,_i r,(jc), and X[ to be 
fh- i.ri (-^ ) ■ For a bit b, let S'i{b) denote the event that Xi = b. 
We set ^ to denote the event that Xi ^ X[. We use b as short- 
hand for a vector bi,. ..,bii of bits. The indicator x(b) is 1 
when t^(b) ^ Tv(^(i)). 

E^pnfK//,,.)] 
= E,5?[Pr[T„(//,_i_r, {x)Jh-\.n(A-,- ■ -^fh-i.rjx)) 

= E,^[PrK(Xi ,X2, . . . ,Xd) ^ tAX[,X2, . . . ,Xd)]] 

X 



-■^ALX (b) Pr[4 {bi)/\^] Pr[n <^' (b. 



!=2 

= Y.EAx{bWi{b,)A,^]Vr[f{S,{bi)]] 

h '^ ^ '=2 

= Y.^:AxCb)PrWiibi)/\:nP^[llE,^[£iibt)]] 

b '" "" '=2 

d 

= £E,^[;t(^)Pr[^i(/7i)A^]]nE,^[Pr[//,-i,n=^d] 
^{l/2)''-'Y,E,^[x{b)Pr[<^,ibi)An 

h 

The final step uses the induction hypothesis. Now, we use 
Proposition|2]todeal withPr;t[^i(^i)A^]] =Vx^[fh-i^,^{x) ^ 
b\ A (/ft-1,,-1 [x) = fh-\,ri {x'^'^"'))]- Let ^' be the distribution of 
transfer functions excluding T,.. 

E,^[Inf,(//,,.)] - {l/2f-'Y.^,^[x{b)Vr[S,{b,)A^]] 

h 

= (l/2)''^E.^[;^(^)Inf,(/.-,,.,)] 

b 

= E,r [Inf, (//,- u, )] ( 1 /2Y ^ E,_9;. [X m 

b 

= nE.^[Inf,,_,(T,,)]E,^[Pr[;t(^)]] 

i=\ b 

h-1 

= nE,^[Infv,_,(T,,)]E^[Infi(T,)] 

r=l 

=nE._^[hif,,_,(T,,)] 
1=1 



Consider a tree T where all leaves have fixed depth h. Set L 
to be the set of all leaves of r. Define Inf(r) =L^ez,Inff(//,.r). 
For a leaf £ that is a descendant of some vertex v, suppose 
the path between them is v'o = £,vi,V2,.--,Va = v. Define 
Infprod,,„ = nLiInfv,_.(T,,). 

Lemma 8 Lef ri,r2,. . . ,ra be the children of the root r. 
Let Ti be the subtree rooted at r,-. Then, E,^[Inf(r)] = 

L'E,5j[Inf„(T,)]E.^[Inf(7^-)]- 

Proof: Define L, to be the set of leaves of 7). 

E,^[Inf(r)]=E,^[£lnf^.(/,,,)] 

fcL 

= £E^[£lnf,(/,,,)] 

= ^^E,^[InfprodJ 

/ (leLj 

^Y^Y. E,5',[Inf,,(T,)]E.^[Infprod,_J 

i ieLi 

= ^E^JInf,,(T,-)] E E,^[Infprod,^,,] 

= £E5',[Inf,,(T,)]E^[Inf(7;-)] 

i 

D 
Consider the following randomized tree construction. First, 
we have a probability p^, for all positive integers d. We de- 
fine this construction recursively. Trees of height are just 
singleton vertices. To construct a tree of height h > 1, we first 
have a root r. We choose the number of children to be d with 
probability p^. Then, for each child, we recursively construct 
a tree of height h—1. 

Lemma 9 Consider a completely balanced tree T with root 
r and height h generated by the randomized procedure 
described above. Furthermore, let the transfer functions 
be chosen as described above. Then E[^^g7'Inf^(//, r)] ~ 

{Zd>xPdidY. 

Proof: For any vertex v, let Ty be the subtree rooted at v. We 
will show by induction on h, that for any vertex at height h, 

ni:ieTMt{fh.)] = {^d>iPdidf- 

For vertex v with /; = 0, we trivially have Infv(/o,v) = 1- 
Now consider v at some height h. Suppose v has d chil- 
dren vi,...,Vj/. For each child v,- of v, E5;,Infv;(Tv) = Id- 
By the induction hypothesis, E[Inf(rv,)] (which are identical 
for all /) has value exactly (X.d>\PdIdy^^^ ■ By Lemma ^ 
conditioned on v having d children E^^gj- Inf£(//, v)] ~ 
lKrfE5',[Inf,,(T,.)]E[Inf(7;,)] = Id{i:d>iPdIdf-' ■ By not- 
ing that the probability that v has d children is pj, 
nZleTMdfKv)]^{Zd>iPdId)'- n 

The topology of the Kauffman network 

We now prove some topological properties of the random 
graphs (which are effectively directed Erdos-Renyi graphs). 



We remind that reader that the indegrees for all vertices are 
chosen independently from the same distribution. 

Definition 10 The distance t in-neighborhood of / is denoted 
by Nj'j. This is the set of all vertices whose shortest path 
distance (along directed paths) to i is exactly t. We set 

We define t* = log«/(41og/r„„„). 

Claim 11 Fix a vertex i and let t < t*. Let Si denote the 
event that the subgraph induced by A^<,* ,■ is a directed tree 
with edges directed towards root i. Then, |A^<,* J < «'' and 
Pr(<^) > 1 - 1/xA. 



Proof: Let us start with an empty graph, and slowly add 
random edges in a prescribed order We begin with /, and then 
choose the incoming edges. This gives the set A^j^^-. Then, we 
choose all the in-neighbors of N^j. This is done by iterating 
over all vertices in A^j^^, and for each such vertex, selecting 
every vertex as a neighbor with probability K/n. This gives 
us N2-. Proceeding this way, we incrementally build up A^^^* ■. 
Notethat|A^<,,,|<<„ = ni/4. 

Consider the construction of A^<, ■. Every new element 
added to this set is a uniform random element from [n] . Con- 
sider a random sequence of «'''* elements chosen uniformly at 
random (with replacement) from [«]. The probabiUty that no 
element is repeated at least 



1- 



nV4 



,1/4 



If no element in A'^, ,- is repeated, then the subgraph induced 
by N^i j is a directed tree. D 



Proof of the Main Theorem 

We are now ready to prove our main result. 

Proof: (of Theorem fT) By Claim [5] the average influence 
at time t is (LiLjIiifj(/(., ■))/«. For a fixed vertex /, let us 
compute L;Iiif;(/r,i)- Denote this quantity by X. We apply 
Bayes rule to split E[X] into conditional expectations. 

E[X] = Pi{S'i)E[X\S'i\+Pi(Ii)E[X\4] 

Observing that X is always positive and applying Claim [TT] 
we getE[X] > (1 - l/^n)E[X\S'i]. Note that since Inf;(/,,,) < 
1, ^ < \N^i\ < n^l^- Hence E[X] > E[X\S'i]-n-^/^. We now 
obtain an upper bound applying Claim[TT| again. 

E[X] < Pr(^-)E[XK-] + {l/Vn)E[X\^] 
<E[X\Si\+n-^/'^ 

It only remains to determine E[X|<^]. Conditioned on (§"{, 
the induced subgraph on A/<, , is a directed tree. We apply 

D 



> exp(-Vn/n) > 1 - (\/n) 



-1 



Lemmal9]to complete the proof. 



